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Abstract 



The S-adic conjecture claims that there exists a condition C such that a sequence 
has a sub-linear complexity if and only if it is an S'-adic sequence satisfying Condition 
C for some finite set S of morphisms. We present an overview of the factor complexity 
of S'-adic sequences and we give some examples that either illustrate some interest- 
ing properties or that are counter-examples to what could be believed to be "a good 
Condition C" . 

1 Introduction 

A usual tool in the study of a sequence (or an infinite word) w over a finite alphabet A is 
the complexity function p w (or simply p) that counts the number of factors of each length n 
occurring in the sequence, i.e., p w (n) = Card({w 6 A* | \u\ = n and 3x 6 A*,y e A N : w = 
xuy}) (see Chapter 4 of [BRIO] for a survey on complexity function). The set of factors of 
length n of w is denoted by Fac n (w) and Fac(w) = IJngN F ac n(w). The complexity function 
p w is clearly bounded by (Card(A))™, n e N, but not any function can be a complexity 
function. As an example, it is well known (see [MH40]) that either the sequence is ultimately 
periodic (and then p w (n) is ultimately constant), or its complexity function grows at least 
like ri + 1. Non-periodic sequences with minimal complexity p w ( n ) = n + 1 for all n exist. 
They are called Sturmian sequences and a large bibliography is devoted to them (see Chapter 

2 of [Lot 02] and Chapter 6 of [Fog02] for surveys on these sequences). 

There is a huge literature about sequences with a low complexity. Indeed, see for in- 
stance [Abe03, AB07, A1194, AR91, Cas96, Cas97, Fer95, Fer99, GJ09, Kos98, Rot94]. By 
"low complexity" we usually mean that "the complexity is bounded by a linear function". 
Moreover, many well-known families of sequences can also be indefinitely desubstituted with 
a finite number of morphisms. Formally, an S-adic sequence is defined as follows. Let w be 
a sequence over a finite alphabet A. If S is a set of morphisms (possibly infinite), an S-adic 
representation of w is given by a sequence (a n : A* n+1 — > A* n ) ng N of morphisms in S and a se- 
quence (a n ) n£ N of letters, a* e Ai for all i, such that 1 A = A, w = lim^+oo <Jo°~i 4 4 4 a n( a n+i) 
and lim„^ +00 \cr <j 1 ■ ■ ■ cr n (a n+ i)| = +oo, where a^ +1 is the sequence only composed of occur- 
rences of a n+ i. The sequence (a„) ne N G >S N is the directive word of the representation. In 
the sequel, we will say that a sequence w is S-adic if there exists a set S of morphisms such 
that w admits an S'-adic representation. Observe that in most cases, we deal with finite sets 
S of morphisms. 

An open problem is to determine the link between being an S'-adic sequence and having 
a sub-linear complexity (see [AR91, Fer96, Lerl2b]). This problem is called the S-adic 
conjecture. 

Conjecture 1.1 (S-adic conjecture). There exists a condition C such that a sequence has a 
sub-linear complexity if and only if it is an S-adic sequence satisfying Condition C for some 
finite set S of morphisms. 

1 The topology over A N is the classical product topology of the discrete topology over A. 
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It is clear that we cannot avoid considering a particular condition since there exist some 
purely substitutive sequences with a quadratic complexity. 

In this paper, we present an overview of the factor complexity of S"-adic sequences and 
we give some examples that either illustrate some interesting properties or that are counter- 
examples to what could be believed to be "a good Condition C" . 

In all what follows, we consider that alphabets are finite subsets of N and if a : A* — > B* 
is a morphism with A = {0, 1, . . . , k}, we write a = [o"(0), . . . , o~(k)]. The following example 
is classical when considering S-adic sequences. 

Example 1.2. Let us define the four morphisms Rq, R\, L and L\ over {0,1} by Rq = 
[0,10], R x = [01,1], L = [0,01] and L x = [10,1]. Since the work of Morse and Hed- 
lund [MH40], it is well known that for any Sturmian sequence w, there is a sequence (k n ) nE ^ 
of integers such that 

w= lim L^R^L^R^L^R^ 5 ■■■L k 1 4n+2 R k 1 in+3 (0 u) ). (1) 

n— >+oo 

It is important to notice that, when we talk about an .S-adic sequence, the corresponding 
directive word (a n ) ne N £ S** is always implicit (even when it is not unique). Indeed, for a 
given set 5* of morphisms, we will see that two distinct 5-adic sequences can have different 
properties depending on their respective directive words. 

2 Comparison between morphic and -S-adic sequences 

The aim of this section is to compare morphic sequences with S'-adic sequences. In partic- 
ular, we show that the factor complexity of morphic sequences is rather restricted and can 
be dependent on some combinatorial criteria although it is not the case at all for S"-adic 
sequences. 

2.1 Morphic and purely morphic sequences 

Purely morphic sequences correspond to S"-adic sequences with Card(S') = 1. If S = {cr}, we 
then have (Jqo~io~2 ■ ■ ■ = ooo ■ ■ ■ = a w . In that case, the complexity functions that can occur 
have been completely determined by Pansiot in [Pan84]. Indeed, he proved that for purely 
morphic sequences w = cr w (a) = \im n ^ +OQ a n (a UJ ) with o non-erasing (i.e., a(b) is not the 
empty word for all letters b), the complexity function p w (n) can have only five asymptotic 
behaviors that are 2 0(1), 6(n), 6(nlogn), O(nloglogn) and Q[n 2 ). Moreover, when the 
sequence w is aperiodic, Morse and Hedlund proved that its complexity function cannot be 
0(1) (see [MH40]). Then, Pansiot proved that the class of complexity of the sequence only 
depends on the growth rate of the length of the images. 

Definition 2.1. Recall that a morphism a : A* — > A* is said to be everywhere growing 
if it does not admit bounded letter, i.e., letter b such that lim n ^ +00 |cr n (6)| < +oo. We let 
A<b,cf (or A<s when no confusion is possible) denote the set of bounded letters of a. By 

2 f(n) G Q(g(n)) if 3d,C 2 > 0, n Q Vn > n \C x g{n)\ < \f(n)\ < \C 2 g(n)\. 
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opposition, a non-bounded letter is called a growing letter. Since for all letters a, we have 
|cr n (a)| G 9(n aa /3") for some a a in N and f3 a > 1 (see [RS80]), any everywhere growing 
morphism satisfies exactly one of the following three definitions: 

1. a morphism o : A* — > A* is quasi-uniform if there exists > 1 such that for all letters 
a G A, |a n (a)| G 9(/3 n ); 

2. a morphism a : A* — > A* is polynomially diverging if there exists /3 > 1 and a function 
a : A ->- N, a ^ 0, such that for all letters a e A, \a n (a)\ e <d(n a ^/3 n ); 

3. a morphism a : A* A* is exponentially diverging if there exist a a , a 2 G A, a^, a 2 G N 
and /3i,/3 2 > 1 with ft ^ /3 2 such that for each i G {1,2}, |a n (ai)| G 6(^/3"). 

Theorem 2.2 (Pansiot [Pan84]). Let w = cr^a) fre a purely morphic sequence with a non- 
erasing. 

1. If a is everywhere growing and 

i. quasi-uniform, then 3 p w (n) G 0(n); 

ii. polynomially diverging, then p w (n) G G(nloglogn); 
Hi. exponentially diverging, then p w (n) G G(nlogn). 

2. If cr is not everywhere growing and if there are infinitely many factors of w in A%, 
then p w (n) = 9(n 2 ). 

3. If a is not everywhere growing and if there are only finitely many factors ofw in A^, 
then there exists a purely morphic sequence T w (b) with r :£?*—?• B* everywhere growing 
and a non-erasing morphism A : B* — > A* such that w = A(r w (6)). In this case, we 
havep w (n) G Q(p T ^( b )(n)). 

One could regret that Theorem 2.2 only holds for non-erasing morphisms. However, the 
following result states that when the morphism is erasing, one can see the obtained purely 
morphic sequence as a morphic sequence (i.e., an image under a morphism of a purely 
morphic sequence) with non-erasing morphisms. The result is due to Cobham [Cob68] and 
has been recovered later by Pansiot [Pan83]. It can also be found in Cassaigne and Nicolas's 
survey [CN03]. 

Theorem 2.3 (Cobham [Cob68] and Pansiot [Pan83]). Ifw is a morphic sequence, it is the 
image under a letter-to-letter morphism of a purely morphic word o" w (a) with a a non-erasing 
morphism. 

Theorems 2.2 and 2.3 show that to compute the complexity function of a purely morphic 
sequence, it is sometimes necessary to see it as a morphic sequence. It is therefore natural 
to be interested in the complexity function of such sequences. By definition, it is obvious 
that any purely morphic sequence is morphic. The converse is known to be false since at 
least 1980, when Berstel proved that the Arshon word is morphic but not purely morphic 

3 f(n) e 0(g(n)) if 3C > 0, n Vn > n \ f(n)\ < \Cg(n)\. 
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[Ber80] (other examples can be found in the litterature, as for instance, the result by Seebold 
showing that the unique, up to letter permutation, binary overlap-free word which is a fixed 
point of a morphism is the Thue-Morse word [See85]). 

Moreover, not only the class of morphic sequences strictly contains the class of purely 
morphic sequences, but also the asymptotic behaviors of the complexity functions are dif- 
ferent. Indeed, Example 2.4 shows that the classes of complexity given by Pansiot are not 
sufficient anymore. 

Example 2.4 (Deviatov [Dev08]). Let w be the morphic sequence r(o"^(0)) where a and r 
are defined by 



a 



1 ^ 12 

2 ^ 23 
3^3 



and o : < 



(0^0 

1 i-> 1 
2^2 
3^2 



We have p w G Q(riy/n). 

Other examples can be found in [Pan85]. Indeed, for all k > 1, Pansiot explicitly built 
a morphic sequence w whose complexity function satisfies p w (n) G Q(ny/n). Consequently, 
the number of different asymptotic behaviors for the complexity function of morphic se- 
quences is at least countably infinite. However, the behaviors Q(n^/n) seem to be the only 
new behaviors with respect to purely morphic sequences. Indeed, in [Dev08] Deviatov proved 
the next result and conjectured an equivalent result of Pansiot's Theorem (Theorem 2.2) for 
morphic sequences. 

Theorem 2.5 (Deviatov [Dcv08]). Let w be a morphic sequence. Then, either p w (n) G 
0(n 1+ fc) for some fegN*, or p w (n) G O(nlogn). 

Conjecture 2.6 (Deviatov [Dev08]). The complexity function of any morphic sequence only 
adopts one of the following asymptotic behaviors: 0(1), O(n), 0(nloglogn), 0(nlogn), 
0(n 1+ fe) for some fceN. 

In particular, Theorem 2.5 implies that the highest complexity that one can get is the 
same for morphic sequences and for purely morphic sequences. This can be explained by the 
following result. 

Proposition 2.7 (Cassaigne and Nicolas [CN03]). Let w be a one-sided sequence over A 
and a : A* — > B* be a non-erasing morphism. If M = max ag ^ |a(a)| ; for all n we have 
Pa{w)( n ) < Mp w (n). Moreover, if w is purely morphic and a is injective, then p a {w){n) G 
0(p w (n)). 

The previous discussion shows that the factor complexity of morphic sequences is rather 
constrained. To conclude this section, we give some examples of how some additional com- 
binatorial criteria can even more restrict it. 

A first well-known fact is that if a purely morphic sequence is k-power-free, i.e., it does 
not contains any factor of the form u k , then its factor complexity grows at least linearly and 
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at most like nlogn (see [ER83]). We will consider a similar criterion on S-adic sequences in 
Section 3.5. 

Another such criterion is the uniform recurrence of the sequence, i.e., any factor occurs 
infinitely often and with bounded gaps in w. For morphic sequences, this implies that 
the complexity is sub-linear (see [NP09]). Actually, the uniform recurrence of a morphic 
sequence is even equivalent to its linear recurrence, i.e., any factor u occurs infinitely often 
and with gaps bounded by K\u\ (see [Dur98, Dur]). Furthermore, if w is a morphic and 
uniformly recurrent sequence over A, then w is a morphic sequence r(a w (a)) with o a 
primitive morphism (this result was already proved in [Dur98] in the particular case of 
morphic sequences ^(^(a)) with ip non-erasing and ip everywhere growing). The following 
result also provides an algorithm to check whether a purely morphic sequence is uniformly 
recurrent. 

Theorem 2.8 (Damanik and Lenz [DL06]). A purely morphic sequence w = cr w (a) is uni- 
formly recurrent if and only if there is a growing letter b G A that occurs with bounded gaps 
in w and such that for all letters c G A there is a power a k such that c occurs in o~ k (b). 

2.2 5-adic sequences 

The previous section shows that the factor complexity of morphic sequences are rather 
restricted (especially for purely morphic sequences). In particular, the class of uniformly 
recurrent morphic sequences is strictly contained in the class of morphic sequences with sub- 
linear complexity. In this section, we show that, for S-adic sequences, things are strongly 
different. A first important result is the following. 

Proposition 2.9 (Cassaigne [Fogll]). Let A be an alphabet and I ^ A. There exists a finite 
set S of morphisms over A' = A U {/} such that any sequence over A is S-adic. 

In particular, this implies that one can get any high complexity with S'-adic sequences (al- 
though for morphic sequences, the complexity is at most quadratic). Moreover, the following 
proposition implies that the set of possible asymptotic behaviors for the complexity function 
of S'-adic sequences is uncountable (although it is finite for purely morphic sequences and 
conjectured to be countable for morphic ones). See also [MM 10] for another approach to 
build sequences whose complexity is closed to a given function. 

Proposition 2.10 (Cassaigne [Cas03]). Let f : M + — > M + be a function such that 

i. lim^+oo {^- t = +oo; 

ii. f is differentiable, except possibly at 0; 

Hi. lim^+oo f'(t)tP = for some f3 > 0; 

iv . f is decreasing. 

Then there exists a uniformly recurrent sequence w over {0, 1} such that 1 log(p w (n)) ~ f{n). 
4 f( n ) ~ d(n) if Ve > 3n Vn > n \f(n)/g(n) - 1\ < e. 
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In particular, the function f(n) in the previous proposition can be taken equal to n a for 
any a with < a < 1. 

Another big difference is that the class of uniformly recurrent S-adic sequences with sub- 
linear complexity is a very short part of the class of uniformly recurrent S-adic sequences. 
Indeed, recall that the topological entropy of a sequence over an alphabet A is the real number 
h with < h < log(Card(A)) defined by 



A uniformly recurrent sequence w over an alphabet A with at least two letters a,b G A 
cannot have maximal complexity (p w (^) = Card(A) n ): since all powers a n occur in w, there 
are unbounded gaps between two successive occurrences of b in w. However, together with 
Proposition 2.9, the following result shows that, except the maximal one p w (^) = Card(v4) n , 
any high complexity can be reached by uniformly recurrent S-adic sequences. 

Theorem 2.11 (Grillenberger [Gri73]). Let A be an alphabet with d = Caid(A) > 2 and h G 
[0, log(d)[. There exists a uniformly recurrent one-sided sequence w over A with topological 
entropy h. 

The following result provides an S-adic characterization of uniformly recurrent sequences. 
A sequence is said to be primitive S-adic (with constant Sq) if Sq G N is such that for all 
r G N, all letters in A r occur in all images a r • • • a r+so (a), a G A r+SQ+ i. It is said to be weakly 
primitive S-adic if for all r G N, there exists s > such that all letters in A r occur in all 
images a r ■ ■ •cr r+s (a), a G A r+s+ i. Any primitive S-adic sequence is weakly primitive and 
Durand [Dur03] proved that any weakly primitive S-adic sequence is uniformly recurrent. 
Following a construction based on return words (see Section 3.4 for definition), one can prove 
that the converse it true. More precisely, we got 

Theorem 2.12 (Durand [Dur03], Leroy [Lcrl2a]). A sequence w is uniformly recurrent if 
and only if it is weakly primitive S-adic. 

Moreover, the directive word can be chosen to be proper and ifw does not have a sub-linear 
complexity, then S is infinite. 

The proof is similar to the proof of the following result which gives an S-adic characteri- 
zation of linearly recurrent sequences. Let recall that a sequence is said to be proper S-adic 
if all morphisms in its directive word are proper, i.e., for all n there are letters a, b G A n such 
that a n (A n+ i) C aA* n b. 

Theorem 2.13 (Durand [Dur03]). A sequence w is linearly recurrent if and only if it is 
primitive and proper S-adic with Card(S') < +oo. 

The next example shows that the linear recurrence is an even stronger condition that the 
primitivity for S-adic sequences (contrary to what holds in the morphic case). 

Example 2.14 (Durand [Dur03]). Let S = {cr, r} where a and r are defined by 



h 



lim 



log(p(n)) 



n 



a : < 1 h-> 101 




and r: 14 021 
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The sequence 



w = lim ara 2 r ■ ■ • (T ri r(O aj ) 



is primitive S-adic but not linearly recurrent. 

3 /S-adicity and sub-linear complexity 

The aim of this section is to explore some ideas that one could have about the S-adic 
conjecture. First, we present some sufficient conditions for S-adic sequences to have a sub- 
linear complexity and we show that we cannot make them weaker. Then, we give some 
counter-examples to some conditions that one could naturally believe to be sufficient to have 
a sub-linear complexity. 

3.1 The growth rate of the length have still some importance 

Durand [DurOO, Dur03], gave some sufficient conditions for an S-adic sequence to have a 
sub-linear complexity. The main condition is the one given by the following result and is 
a generalization of what exists for purely morphic sequences (see Theorem 2.2). The other 
conditions are simply consequences of it. 

Proposition 3.1 (Durand [Dur03]). Let w be an S-adic sequence with Card(S) < +oo and 
whose directive word is (cr„)„ g N with a n : A* +1 — > A* n and A the alphabet ofw. If there is 
a constant D such that for all n, 



then p w (n) < Dmax ffii6S](ieAii+I |cr n (a)|(Card(A)) 2 ra with A = U neN A n . 

Corollary 3.2 (Durand [Dur03]). Ifw is S-adic with Card(S) < oo and all morphisms in S 
are uniform, then we have p w (n) < l(Ca.rd(A)) 2 n with A = U n( zfqA n and I = max |er(a) | . 



Proposition 3.3 (Durand [DurOO]). Ifw is a primitive S-adic sequence with Card(S') < +oo 
and constant s directed by (cr n ) ngN with o n : A* n+1 — > A* n and A the alphabet of w, then 
there exists a constant D such that for all non-negative integers r, 



Corollary 3.4. Let S be a set of non- erasing morphisms and r 6 S be strongly primitive 
(i.e., for all letters a ofr(A), a occurs in all images r(b) forb G A). Any S-adic sequence for 
which t occurs infinitely often with bounded gaps in the directive word is uniformly recurrent 
and has a sub-linear complexity. 



max -j 

a,b£A n+1 | tT o ■ ■ ■ O"; 




(2) 



cr<=S,a£A(cr) 
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The everywhere growing property can be naturally transposed to S'-adic sequences. But, 
contrary to what holds in the purely morphic case, under that assumption, the condition 
given by Equation (2) is not equivalent to sub-linear complexity. Indeed, even some Sturmian 
sequences do not satisfy it (those with unbounded coefficient (fc n )neN in Example 1.2). One 
could therefore try to make that condition a little bit weaker. 

We can observe in Example 1.2 that Condition (2) is still infinitely often satisfied. Indeed, 
let us consider the directive word (r n ) neN defined by r = L^R^Li, r 2n = Lg 4n_1 i?Q 4n+1 L 1 
for n > 1 and r 2n+ i = L^ 4n+2_1 i?^ 4n+3 L for n > 0. For all n, there exist some integers i and 
j such that either r n = [0 4 1(F +1 , (PICP] or r n = [POP, POP 4 " 1 ]. With these morphisms, we 
have 

r oTl . . . Tn . . . = LfrBfrL^B* 3 ■ ■ ■ L^ 2 R\ An+3 ■■■ (3) 

and there is a constant K such that for all n, max ai b g ^ n+1 n~T7^ < K. The sequence 

(T"n)neN is called a contraction of the directive word (L^R^ 1 ■ ■ ■ ). Observe that the set S = 
{r n | n e N} of morphisms might be infinite (when (fc„)„ e N is unbounded). Consequently, it 
may be interesting to work either with infinite sets of morphisms or with contractions. 

But, Example 3.5 below shows that Proposition 3.1 is not true anymore when Card(S) = 
oo. Indeed, if we consider the contraction (cr n ) neN of the directive word of Proposition 3.6 
defined for all n > by 

On = 7 fc >, (4) 

we have |<7o • • ■ cr„(0)| = | cr ■ ■ ■ cr n (l)| for all n although the complexity is not sub-linear as 
soon as the sequence (fc n ) n£ N is unbounded. 

Example 3.5. Let fi be the Thue-Morse morphism [01, 10] and let 7 be the morphism 
[001, 1]. From Theorem 2.2 we know that the sequence 

7 W (0) = 001001 2 001001 3 001001 2 001001 4 ■ • • 

has a quadratic complexity. 

Proposition 3.6. Let (k n ) n( z^ be a sequence of non-negative integers. The sequence 

w 7)M = lim 7 feo /i7 fcl /V c > ■ ••7 fen /i(0 tJ ) 

n— >+oo 

is uniformly recurrent. Moreover, w 7)/x has an at most linear complexity if and only if the 
sequence (A; n )neN is bounded. Finally, for all n we have 

| 7 W>7V--7 fe >(0)| = | 7 WWV--7 fc >(l)l, 

and denoting 

4 = |7V>7 fc >-7S(0)|, 

we have 

Pw 7 , M (4) <44-2. 
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Before proving the result, recall that a right (resp. left) special factor in a language 
L C A* is a factor such that there are at least two letters a,b G A for which ua and ub (resp. 
au and bu) belong to L. A bispecial factor is a factor which is both left and right special. 
This definition can be extended to words and sequences w by replacing L by Fac(w). Let 
us recall the following result. 

Theorem 3.7 (Cassaigne [Cas96]). A sequence has a sub-linear complexity if and only if 
there is a constant K such that for all n, the number of right (resp. left) special factors of 
length n is less than K . 

Proof of Proposition 3.6. First, as fi occurs infinitely often in the directive word, w 7jM is 
weakly primitive {7,/i}-adic and so, by Theorem 2.12, it is uniformly recurrent. 

Now let us study the complexity depending on the sequence (k n ) ne ^. The case of a 
bounded sequence is a direct consequence of Corollary 3.4. Hence let us consider that the 
sequence (fc n ) ngN is unbounded and let us show that the complexity is not at most linear. 
Due to Theorem 3.7, we only have to prove that the number of right special factors of length 
n of w 7iM is unbounded. 

As already mentioned in Example 3.5, the fixed point 7 W (0) has a quadratic complexity. 
Consequently the number of right special factors of 7^(0) of a length n is unbounded. 

We let readers check that bispecial factors of 7^(0) are words e, 0, 1 and words 7(lt>) 
with v bispecial, and that other right special factors of 7^(0) are the suffixes of words 7(f) 
for v right special. Thus, by induction, one can state that all right special factors of length 
n of 7 aJ (0) occur in 7 n+1 (0). 

Now let us show that if u is a right special factor in 7 fcn+1 (0), then 7 fc °/i7 fcl /z • • • 7 fcn /i(ii) 
is a right special factor of w of length \u\2 q with q = Yli=o(^i + -0- Indeed, as /i(0) and 7(0) 
start with and /i(l) and 7(1) start with 1, the image of u is still a right special factor. 
Moreover, fi(u) contains exactly \u\ occurrences of the letter and n occurrences of the letter 
1, and both 7 and /x map a word with the same number of and 1 to a word of double length 
with the same number of and 1. Hence |7 fc °/i7 fel /i • • ■ r y kn fj J (u)\ = \u\2 q with q defined as 
previously. Now, if u and v are two distinct right special factors of length m of 7^(0), then 
7 fc °/i7 fcl /i • • ■■y kn fi(u) and 7 fc °/i7 fcl /i • • -7 fcn /i(f) are two distinct special factors of length m2 q 
of w. As the number of right special factors of a given length of 7^(0) is unbounded, and as 
the sequence (fc n )neN is unbounded, the number of right special factors of a given length of 
w is also unbounded which concludes the first part of the proof. 

The last step is to show that, for all integers £ n , we have Pv, J£ n ) — 4£n- F° r a ^ non- 
negative integers n, we already know that 

| 7 fc V7 fe V • • ■ 7 fc "M0) I = |7 fc V7 fc > ■ • • 7 fc "Ml) I = 4 = 29 

with q as defined previously by ^" =0 (&i + 1). Consequently, all factors u of length £ n are 
factors of |7 fe °/x7 fcl /i- • ■ , y kn n(v)\ for some word v of length 2. As there are only 4 possible 
binary words of length 2 and as there are less than £ n + 1 distinct factors of length i n in 
a word of length 2£ n , we obtain p w ^A£ n ) < 4£ n + 4. However, among the 4£ n + 4 words, 
both words 7 fc °/i7 fcl /i • ■ ■ 7 fcn /i(0) and 7 fc °/i7 fcl /i • • ■ 7 fc "/i(l) have been counted 4 times, hence 
Pw 7 , M (4)<4£„-2. □ 
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3.2 The condition C of the conjecture could not only concern the 

set S 

Since it seems hard to make the condition of Equation (2) weaker, another idea is to determine 
new sufficient conditions that are independent from it. A first attempt in this direction was 
proposed by Boshernitzan, asking whether if S contains only morphisms for which the fixed 
points (if exists) of their powers have sub-linear complexity, then any S-adic sequence has a 
sub-linear complexity. 

But, Boshernitzan eventually provided the following counter-example to that conjecture. 
Since we did not find any detailed proof of it, we provided it. 

Example 3.8. Let 7 and E be the morphisms over {0, 1} respectively defined by [001, 1] and 
[1,0]. Observe that both morphisms jE and E^ are primitive. Consequently, they admit a 
power whose fixed points have sub-linear complexity. We consider the sequence 

w,, E = lim ^E 1 2 E 1 3 E--- 1 n - 1 E 1 n {0 u '). 

n— >+oo 

Proposition 3.9 (Boshernitzan). The sequence w 7i £ is S-adic for S = {•yE,E'j}, is uni- 
formly recurrent and does not have a sub-linear complexity. 

Proof. First, by definition, w 7i £ is indeed .S-adic for S = {■yE,E r y}. 

Next, the composition 7 o E o 7 is strongly primitive and occurs infinitely often in the 
directive word of w 7j e. It is therefore a consequence of Theorem 2.12 that w 7) _e is uniformly 
recurrent. 

To prove that w 7j e does not have a sub-linear complexity, by Theorem 3.7, it is suf- 
ficient to prove that the number of its right special factors of length n is unbounded. 
For this purpose, let us introduce notation. For all k G N*, let us define the morphism 
T k = jEj 2 E ■ ■ ■ ^ k ~ x E^ k E and, for all k e N, the sequence 

w {k) = lim - f k+1 E-f k+2 E----f k+n - 1 E 1 k+n (0 u; ). 

We then have w 7j £ = = r^w^), for all k > 1. For all i > 1 we also consider the 
word Ui = 7*(10) = l7*(0). As 100 and 101 are factors of E(w (fc+1) ), and as 100 and 001 are 
factors of -f k - j ^(E(w^ k+ ^)) for all j with 1 < j < k (observe also that 7(100) = 7(10)001 
and 7(001) = 007(10)1), we can deduce that Ui is a right special factor of w^ k ' for all i with 
1 < i < k + 1. As words T^O) and T^l) start with different letters, for all integers i such 
that 1 < i < k + 1, the word T^Ui) is a right special factor of w 7j e, and so are all of its 
suffixes. 

For 1 < i < k + 1, word Ui end with 1001*, so that the longest common suffix between 
and Ui + i is 1*. It follows that the longest common suffix between r^itj) and Yk{ui + i) is the 
word pkT k (V) where p = e and for k > 1, 

p^ir^i 2 ^! 3 )--^^). 

This is indeed a consequence of the fact (we let readers verify it) that for any word u 
containing at least one occurrence of and 1, the word T k (u) ends with ap k where a is the 
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last letter of u when k is even, and where a is the opposite letter to the last letter of u when 
k is odd. 

From what precedes w 7vB has at least n{k) right special words of length |p fc r fc (l fc )| + 1, 
where n(k) denotes the number of integers i between 1 and k + 1 such that |r(uj)| > 
\p k T k (l k )\. Next fact allows to estimate n(k). 

Fact: With f(k) = ^+ 2 , for all k > 2, 

1. for all 1 < i < k+ 1, \T k ( Ui ) \ = 2 i 2 / W, 

2. |p fc r fc (l fc )|<A; 2 2^). 

Proof of the fact. 1. Let i be between 1 and k + 1. By induction, one can verify that 
\Y(0)\o = 2 i and ^(O^ = 2* - 1. As = l7*(0), we have [^| = Mi = 2\ Observe that 
for any word v such that |u|o = \v\i, we have \j(v)\ = 2\v\ and |7(f)|o = |7(*0|i = M and 
\E{v)\ = \E(v)\! = \v\ . 

Thus, as T k = jE^ 2 E ■ ■ ■ j^E^E, \T k ( Ui )\ = 2 i+1 2^Ui = 2 i+1 2 M ^ M = 2*2^*). 

2. Before estimating \p k T k (l k )\, we need an estimate of |r fe (l)|. 

|r fc (i)| = | 7 £7 2 £-- -7*^(1)1 

= |7 /£ (0)| |7 J E 7 2 J E-- -7^(0)1 + ^(O)^ \ 1 E 1 2 E--- 1 k ~ 1 E(l)\ 
= 2 k (1^(01)1 - \r k ^(i)\) + (2 k - 1) 1^(1)1 

k 2 +k+2 , , . . 11, \ 

= 2 — 2 — _ |r fc _ 1 (i)| < 2^t fc >. 

Observe that |r fc (l fc )| = fc|r fc (l)|. One can verify that, for all j with 1 < j < k, Tj^V) < 
T k (l k ). Moreover as k > 2, (lr^l 2 )] < |r fe (l fc )|- 

Hence \p k T k (l k )\ = {lT^l 2 )] + £- =3 {V^V)] + \T k (l k )\ < k\T k (l k )\ < k 2 f(k). □ 

To end the proof of proposition 3.9, it suffices to notice that for all i such that log 2 k 2 < 
i < k + 1, |rfc(uj)| > \p k T k (l k )\, so that n{k) > k + 1 — |~log 2 A; 2 ]. In other words, the 
number of right special words of w 7i g of length \p k Y k {l k ) \ + 1 is unbounded, which shows by 
Theorem 3.7 that w 7i e does not have a sub-linear complexity. □ 

Remark 3.10. The previous result is even stronger than just considering sets S of morphisms 
with fixed points of sub-linear complexity. Indeed, the sequence also has bounded partial 
quotients, i.e., all morphisms occur with bounded gaps in the directive word (over {7-E, -E7}). 

An opposite question to the one previously answered is to ask whether S'-adic sequences 
can have a sub-linear complexity when 5* contains a morphism that admits a fixed point that 
does not have a sub-linear complexity. Example 3.5 positively answers that question and we 
can prove even more. Indeed, the following example provides a uniformly recurrent 5-adic 
sequence with sub-linear complexity such that a "bad morphism" occurs with arbitrary high 
powers in its directive word. 

Example 3.11. Let us consider the morphisms 



P : < 1 1112 




and M : < 1 h-> 1 
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and the sequence 

wbm= iim m ;-.u ? 3 m--- r no--). 

n— >+oo 

Note that /3 is not everywhere growing, its set of bounded letters is A<g t p = {2} and all words 
in A^g g are factors of /3^(0). Hence by Theorem 2.2(2), ppw(p) is in 6{n 2 ). 

Proposition 3.12. The sequence Wp^ defined just above has a sub-linear complexity. More 
precisely, for all n we have p(n + 1) — p(n) G {1, 2}. 

The proof of this proposition will use the link, proved by Cassaigne [Cas97] between 
bispecial factors and the first difference s of the complexity function. Given an infinite word 
w, we recall that the bilateral order of a word u is the value 

m(u) = Card(Fac(w /3iA/ ) n AuA) - Card(Fac(w /3iM ) n Au) - Card(Fac(w /9iM ) n uA) + 1. 

When m(u) > 0, u is a bispecial factor of w called a strong bispecial factor, and when 
m(u) < 0, u is also a bispecial factor of w called a weak bispecial factor. 

Let s w (or simply s) be the function defined for n > by s(n) = p(n + 1) — p(n). We 
have s(0) = 1 and Cassaigne proved (see [Cas97]) that 

s(n + 1) — s(n) = m(u) (5) 

uGFac n (w) 

In next useful result, Bq is the identity morphism and, for k > 0, Bk = M(3M(3 2 ■ ■ ■ Mf3 k . 

Lemma 3.13. A word u is a strong bispecial factor ofwp t M if and only if u = BkM(3 l (l) 
for some k > and some i in {0, . . . , k}. 

A word u is a weak bispecial factor of Wp t M if and only if u = f^M/^lOl) for some 
k > and some i in {0, . . . ,k}. 

Proof. For all integers k > 0, let be the sequence whose directive word is (M, /3 k+1 , 
M, /3 k+2 , ...). Note that, for k > 0, w p>M = B k (w^). 

All words 010, 011, 110, 111 occur in w ( fc+1 ) and all words 010, 011, 120, 111 occur in 
/ gi( w ( fc + 1 )) f or j > i Thus M(3 1 {1) for i in {0, 1, . . . , k + 1} are strong bispecial factors of 
w (fc+i) (observe M/3 i (120) = M/3 i (210) contains the factor 1M/?(1)0). As B fc (0) starts and 
ends with 0, and as -23^(1) starts and ends with 1, B k M(i % {\) for i G {0, 1, . . . , k} are strong 
bispecial factors of wp )M . 

Now let i G {0,1,..., k}. Words 0M^(101)1 and 1M/?(101)0 are factors of w (fc) (re- 
spectively factors of M/3 i+1 (01)1 and lM/3 i+1 (10)). So 0B fc (M^(101))l and lS fc (M/3 i (101))0 
are factors of Wp,M- One can verify that the existence of factor 0Bk(M (3 l (101))0 in w^m 
would imply the existence of factors M(0/3 J (101)0) in /3( fc+1 ^')(w (fc+1) ) for < j < i, and 
of factor 01010 in /3( fc+1-4 )(w( fc+1 )) which is impossible. Similarly the existence of factor 
lB fe (M/3 i (101))l in wp tM would imply the existence of 11011 in /3( k+1 ~ i ) (w^ +1 ">) which is 
impossible. 

We now prove that there are no other strong and weak bispecial factors. 
Let u be a bispecial factor of Wp t M- If u contains no occurrence of the letter 0, u = l n for 
some n > 0. As ^ contains arbitrarily large powers of 1 and infinitely many occurrences 
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of 0, words 1 1 " 1 , 01 n l, ll n are all factors of wp t M- When 01 n is a factor of w^ ^, one 
can verify that l n = BkMf3 l (l) for some k > and some integer % in {0, . . . , k}. Thus \ n 
is a strong bispecial factor of if and only if l n = B k M^ l (l) for some > and some 

integer i in {0, . . . , k}. 

Note that any factor of w^m containing only one occurrence of is on the form 01 n , 
101 n , l n or l n 01 for some n > and is not bispecial. 

Thus assume u contains at least two occurrences of 0. From the shape of the morphisms 

there is a unique integer k and a unique sequence of words 



V,Po, S , Pi, Sl, • • • ,Pk-l, Sfc-i 

over {0, 1} such that 

u = p B 1 (pi)B 2 (p2) ■ ■ ■ B fe -i(pfe-i)Bjfc(f )Sfc_i(s fc _i) • ■ ■ B 2 (s 2 )B 1 (s 1 )s , 

where v is a proper factor of M/3 fc+1 (010) and for all i, < i < k — 1, the word (resp. 
Sj) is a suffix (resp. prefix) of £>j + i(0) or of but is neither equal to £>j + i(0) nor to 

B i+1 (l). 

Since for all i, Bi(0) and Bi(l) do not have any common prefix or suffix, the factor u is 
bispecial if and only if all words p , s ,pi, s 1; . . . ,Pk-i, Sk-i are empty words and v is bispecial 
in w( fc ). Moreover, v contains an occurrence of the letter 0. We also have that u is strongly 
(resp. weakly) bispecial if and only if so is v. Then, it is only a verification that the only 
bispecial factors in that contain an occurrence of and that are factors of M (3 k+1 (010) 
are the words M/9 l (101) for i e {0, 1, . . . , k}, which are weak bispecial factors. □ 

Proof of Proposition 3.12. Let (u n ) n >o be the sequence of strong and weak bispecial factors 
of w^m- Observe that for all k > 0, 

• B k M(3 k+ \l) = Bfc+i(l) = B k+1 M0°(1). 

• |B fc (M^'(l))| < |B*(M^'(101))| < \B k (Mf3 l+1 (l))\, for all i in {0, 1, ... , k}. 

Therefore by Lemma 3.13, u n is a strong bispecial factor of w@ t M if n is even, and it is a weak 
bispecial factor of wp t M if n is odd. As, for any factor, m{u) = 1 if it is strong bispecial, 
m(u) = —1 if it is weak bispecial and m(u) = otherwise, by Formula 5, for all n > 0, 

• s(\u n \ + 1) — s(|w n |) = 1 if n is even, 

• s{\u n \ + 1) — s(|-u n |) = — 1 if n is odd, 

• s(n + 1) — s(n) = if n {|w m | | m > 0}. 

Consequently, since s(0) = p(l) — p(0) = 1, for all n > 0, we have p(n + 1) — p(n) = s{n) G 
{1,2}. □ 
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3.3 Another (easier?) version of the 5-adic conjecture 

A natural idea to try to understand the conjecture is to consider examples composed of 
well-known morphisms. For instance, one could consider the Fibonacci morphism (p = [01, 0] 
whose fixed point is a Sturmian sequence and the Thue-Morse morphism fi— [01, 10] whose 
fixed points both have a sub-linear complexity. We have 

Proposition 3.14. If S = {y?,/u} where (p and fi are defined above, any S-adic sequence is 
linearly recurrent. 

Proof. Let S = {/i, ip}. Let w be an S-adic sequence directed by (c n ) ne ^ and, for all k EN, 
let w( fc ) be the S-adic sequence directed by (cr n ) n > fc . 

By definition of (p and fi, as E {y?(w( fc+1 )), yu(w^ fc+1 ^)}, word 111 does not occur in 
for all k > 1. Note also that 000 does not occur in /i({0, 1}*), and 0000 does not occur 
in ^(w^ 1 -*) since otherwise 111 should occur in w^ k+1 \ Thus both words 0000 and 111 does 
not occur in for all k > 0. This implies that the gap between two occurrences of 01 
and 10 is at most 5 (7 is the maximal length of a shortest word on the form lOiilO or on the 
form OlwOl). 

Now observe that 11 occurs in only in the image of ex*; (10) (and only if = /i). 
This implies that the gap between two occurrences of 11 is at most 12. 

Finally observe that 00 occurs in only in the image of o"i(10) or in the image of 
<Tfc(ll) (when <7fc = <p). As factors of that start with 10 or 11, end with 10 or 11 and do 
not contain any other occurrences of 10 or 11 have length at most 7, the gap between two 
consecutive occurrences of 00 is at most 12. 

We have just proved that the length of the largest gap between two occurrences of a word 
of length 2 in all words is bounded. By choice of S, w is primitive S-adic. Hence by a 
result of Durand [Dur03], w is linearly recurrent. □ 

Let us say that a set of morphisms S is good-adic if all S-adic sequences have a sub-linear 
complexity. Proposition 3.14 provides an example of such a set. Many other examples are 
known as: 

• any singleton {/} with / a morphism with fixed points of linear complexity; 

• the set of Sturmian morphisms {[01, 1], [10, 1], [0, 01], [0, 10]}; 

• the set of episturmian morphisms over an alphabet A: S = {L a , R a | a E A} with for 
all a E A, L a (a) = R a (a) = a, L a (b) = ab and R a {b) = ah for b ^ a; 

• any finite set S that contains only uniform morphisms (see Corollary 3.2); 

• any finite set S that contains only strongly primitive morphisms (see Corollary 3.4). 

Note that if S is good-adic then, for any morphism / in S admitting an infinite fixed 
point, this fixed point must have a sub-linear complexity. But this necessary condition is 
certainly not the only one. 

Question 3.15. What are good-adic sets of morphisms? 
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3.4 About return words 



If u is a factor of a sequence w, a return word to u is a word r such that ru is a factor of w 
that admits u as a prefix and that contains only two occurrences of u. Return words to u in 
w actually correspond to the gaps between two occurrences of u in w. 

Durand [Dur98] proved that primitive morphic sequences, i.e., sequences defined as the 
image, under a morphism, of a fixed point of a primitive morphism can be characterised 
using return words. Hence, it is quite natural to ask whether such a result exists for S-adic 
sequences with sub-linear complexity 

There exist many examples of S'-adic sequences w with sub-linear complexity for which 
there exists an integer K, such that all factors of w have at most K return words. For 
instance, Vuillon [VuiOl] proved that Sturmian sequences are exactly sequences whose factors 
have exactly two return words. Justin and Vuillon [JVOO] extended this result to the family 
of Arnoux-Rauzy sequences (also called strict episturmian sequences) whose factors have 
exactly m return words with m the cardinality of the alphabet. Balkova, Pelantova and 
Steiner [BPS08] characterized sequences whose factors have all the same number of return 
words showing they have all a sub-linear complexity. Improving a result by Ferenczi [Fer96], 
Leroy [Lerl2a] proved that sequences whose complexity verify ultimately 1 < p(n+l)— p(n) < 
2 are S-adic and infinitely many of their factors admit two or three return words. 

Despite of previous example, next proposition show that there exist S"-adic sequences with 
sub-linear complexity such that any long factor has many return words ( "many" depending 
on "long"). 

Example 3.16. Let us consider the three morphisms p, a and s defined by 



p : < 



'a\ \-y 
a 2 M- 1 
6i ^ 1 

[b 2 1 y o 



a : < 



ai i — y a\a 2 a\ 

CL2 I — y a2<l20'2 

b\ i — y bib 2 bi 
b 2 i y b 2 b 2 b 2 



s : 



t-> ai 

1 ^ bx 



For all n > 1 we let ir n denote the morphism pa n s\ we have 

r 2 n - 1 -i 



7T, 



^ 01 eo 01 ei 01 e2 • • • 

1 ^ 10 eo 10 ei 10 e2 --- 





where e is the fixed point of the morphism Exp defined over the infinite alphabet {3 n | n e N} 
by 

Vn G N, Exp(3 n ) = 13 n+1 , 



i.e., 



e = 1313 2 1313 3 1313 2 1313 4 1313 2 1313 3 1313 2 13 



Now let us consider the sequence 



w n = lim 7ri7r 2 • • • ^(0^). 
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Proposition 3.17. The sequence defined above is uniformly recurrent, has a sub-linear 
complexity and for all integers k, there is a length such that all factors ofw^ of length at 
least £k have at least k return words in w n . 

Proof. The uniform recurrence is a direct consequence of Theorem 2.12. Since w is S-adic 
with S = {p, a, s}, it is a consequence of Corollary 3.2 that has a sub-linear complexity. 

Let us prove that all sufficiently long factors of w„- have many return words. For all 
k > 1, we let w( fc ) denote the sequence 

w (fe) = lim n k n k+1 ---rr n (0 UJ ). 

n— >oo 

We obviously have = and for all k > 1, w n = wi ■ ■ ■ nk(\v^ k+1 ^). We also have 
|7ifc(0)| = |vr fc (l)| = 3 fc for all k and we let £ k denote the length 

k{k + l) 

Ki7T 2 • • -7^(0)1 = |7Ti7r 2 • • -^(1)1 = 3 2 . 

Due to the shape of morphisms ~K\ , any factor u of w n of length at least equal to 24 can 
be uniquely decomposed into images 7Ti(0) and 7ri(l), i.e., there is a unique word v in 
such that u G Fac(7ri(t>)) and such that if u G Fac(7Ti(V)), then v G Fac(f'). Indeed, given 
such a factor u, either it contains an occurrence of 00 or of 11 (which uniquely determine 
how to decompose the factor), or it is equal to 7Ti(10101010) or to ti\ (01010101) (neither 
101010101, nor 010101010 occur in w( 2 )). 

Similarly, we see that, for all k > 2, any factor of of length at least equal to 7 can 
be uniquely decomposed into 7i k (0) and 7rfc(l). 

Let n be a positive integer (suppose it is large). The sequence (h)k>i is increasing so 
there is a unique positive integer k such that lk-i < n < l k . Consequently, all factors of 
length n of w, belong to Fac (tt 1 - ■ ■ n k ({0, l} 2 )). From what precedes, there is a unique 
word v in Fac (% k ({0, l} 2 )) C Fac (w^ fc ^) such that u G Fac^x • • ■ 7r fc _ 1 (f )) and any word v' 
over {0, 1} such that u G Fac ■ ■ 7r fc _ 1 (f ')) contains v as a factor. 

By unicity of v, the number of return words to u in w„- is at least equal to the number of 
return words to v in w^ fc ^ (some return words could also occur in tti ■ ■ •7Tfc_i(f)) so we only 
have to show that the number of return words to v in is at least linear in k. This will 
prove the result since k increases with n. 

First, if \v\ > 7, we have seen that there is a unique word x G {0, 1, 00, 01, 10, 11} such 
that v G FsLc(irk(x)) and such that if v G Fa,c(7Tk(y)) , then x G Fa.c(y). The number of return 
words to v in w( fc ) is at least equal to the number of return words to x in w^ +1 ^ and we let 
the reader check that it is at least linear in k. 

Let us suppose that \v\ is less than 7 and that the factor 000 occurs in v (the case 
111 G Fac(f ) is similar). Then, from the shape of 7^, v only occurs in w^ fc ^ as factor of 7Tfc(l). 
The number of return words to v in is then at least equal to the number of return words 
to 1 in \y( fc+1 ) and we let the reader check that it is at least linear in k. 

Now suppose that neither 000 nor 111 occur in v (still with \v\ < 7). Since the number of 
return words to a factor is the same as the number of return words to the smallest bispecial 
factor containing it, we only have to count the number of return words to bispecial factors 
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of w( fc ) that are smaller than 7 and that do not contain 000 neither 111 as factors. We let 
the reader check that these factors are exactly 

{0, 1, 00, 01, 10, 11, 010, 101, 0101, 1010, 01010, 01011, 11010, 10101, 010101, 101010} . 

The bispecial factors 01010, 10101, 010101 and 101010 only occur in w< fc > as factors of 7r fc (01) 
and 7Tfc(01) but are not factor of vr fc (0) neither of 7Tfc(l). Since the number of words filling the 
gaps between occurrences of 01 and 10 in w^ +1 ^ is linear in k, these bispecial factors have 
a number of return words which is linear in k too. 

We let the reader check that the other bispecial factors have a number of return words 
in which is linear in k (each return word containing a different highest power of or of 
1 depending on the bispecial factor). This completes the proof. □ 

To end with return words, let us observe that it is possible to build sequences whose 
complexity is not sub-linear and in which infinitely many factors have a bounded number 
of return words. Indeed, for instance, the sequence 7^(0) (see Example 3.5) has a quadratic 
complexity and any factor 7 n (l) admits two return words. After last observation it seems 
difficult to characterize sequences with a sub-linear complexity using return words. Never- 
theless next interesting question is open. 

Question 3.18. Let w be a sequence such that, for some integer K, all factors of w has 
at most K return words. Is it true that w is S-adic for some suitable set S and has linear 
complexity? 

3.5 What about the number of distinct powers? 

As already mentioned, it is known that the factor complexity of fc-power-free morphic purely 
sequences grows at most like n log n. In this section, we explore links between bounding the 
number of distinct exponents of factors instead of the maximal exponent of factors and the 
factor complexity. 

Let us introduce the following notation: given a language L and a word u — U\ ■ • ■ U\ u \ G L, 
Pow(it, L) is the set of all non-negative integers i such that there are some words p and s 
such that pu l s belongs to L, \p\ < \u\, \s\ < \u\, p is not a suffix of u and s is not a prefix of u. 
With the notation of previous section, we have Pow^ ■ ■ • 7T n (0), Fac(w 7r )) = {3 1 , 3 2 , . . . , 3 n }, 
which shows the existence of an S-adic sequence with an at most linear complexity that have 
factors with an unbounded number of distinct exponents. 

The previous phenomenon also holds for S-adic sequences that do not have a sub-linear 
complexity. Indeed this is the case of the S-adic sequence considered in Proposition 3.6 when 
the sequence (k n ) n€N is unbounded, as one can observe that 

Pow( 7 fc V7 fcl /i7 fc > ■ ■ • 7 fc "Ml), Fac(w 7i/ J) = {1,2,..., k n+1 + 2}. 

If it seems difficult from previous discussion to get an S-adic characterization of sequences 
with sub-linear complexity using the number of distinct powers of their factors, next propo- 
sition inspired by word w 7)/i shows that the number of distinct powers can be used to prove 
that a sequence does not have a sub-linear complexity. Recall that a word u is primitive if 
it is not a power of a smaller word v, i.e., there is no integer k > 2 such that u = v k . 
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Proposition 3.19. Let w be a recurrent sequence over A. If there is a constant C > 1, 
a sequence (u n ) ne ^ of primitive words in Fac(w) and an increasing sequence (k n ) n( z N of 
positive integers such that (\u n \) ne n is increasing and such that for all n, all integers i with 
< i < k n belong to Pow(u n , Fac(w)), then w does not have a sub-linear complexity. 

Before proving the result, let us recall the classical following result (see for instance 
Proposition 3.1.2 in [Lot97]). 

Proposition 3.20. If x and y are two non-empty words such that xy = yx, then there is a 
word v smaller than x and y such that x = v n and y = v m for some positive integers n and 
m. 

Proof. Let us give a lower bound on the number of factors of length fc n |M n | of w. When C 
is at least equal to 2, the property is also true with 2 — e instead of C for all e > 0, thus, 
without loss of generality, we assume C < 2. 

Let n be a positive integer and let i in {0, 1, . . . , k n — | ^r] }. By hypothesis the word 
w n n is a factor of w. 

Let us study the return words to u^"~\ Since u n is a primitive word, a return word r to 
u^ n ~ l is either u n or has length |r| > (k n — i)\u n \. Indeed, if not, one of the following holds 
true: 

1. |r| = j\u n \ with 2 < j < k n — i; in this case, we have r = u J n which is not a return word 
to u n ; 

2. |r| < (k n — i)\u n \ and |r| is not a multiple of \u n \; in this case, there is a non-empty 
word u' smaller than u n such that u'u n = u n u' and, by Proposition 3.20, u n is not a 
primitive word. 

Now, since k n — i belongs to Pow(w n , Fac(w)) and w is recurrent, there are two words pt 
and Si such that PiU^~ l Si belongs to Fac(w), \pi\ < \u n \, |sj| < \u n \ and u n does not admit 
Pi as a suffix neither Si as a prefix. Thus, from what precedes, pi is a suffix of a return word 
to u k ™~ % with |rj| > ik n — i)|w n | and such that riU^ n ~ l Si is a factor of the recurrent word 

w. 

By hypothesis, we have C < 2 and k n — i > \ytf\ so it comes 

Ir^""*! > 2 

Let Vi denote the suffix of length fc n |w n | of r^M^" -1 and, when % > 1, let Xi denote a factor 
of w of length {k n + i)\u n \ that admits i>jSj as a prefix. Let also x$ = For a given word 
w — Wf ■ w\ w \, we let w[r : s] denote its factor w r ■ ■ ■ w s -\. By definition of return words, 
the word u^ n ~ l occurs only once in V{. Thus all the words 

Xi[i : t + k n \u n \], t e {1, 2, ... , i\u n \}, 

are different. 



~c 



\u. 
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Let us show that if i and j belong to {1, . . . , k n — [^1 }, i < j, then all words 

Xi[£ : £ + k n \u n \], £ G {1,2, . . . ,i\u n \} 
Xj[m : m + k n \u n \), m G {1,2, ... ,j\u n \} 

are also distinct (by construction they are not equal to xq). Suppose by contrary that there 
are some integers £ G {1,2,..., z|w„|} and m G {1, 2, . . . , j|w„|} such that 

Xi[£ : £ + k n \u n \) = Xj[m : m + k n \u n \] = X. 

Since £ < i\u n \ and m < j\u n \, there are some words a, (3, 7 and 5 such that 

X = au^P = 7u^ n ' j 5. 

We have \a\ > \j\ otherwise w^" - -' would occur twice in Vj. Let us show that we also have 
|a| < |t| + (kn - j)\u n \ - \u n \. Ask n -i>k n -j> > \, we have 

(k n - i)\u n \ > (k n - j + l)\u n \ > ^ n ^ n + \u n \- 

Thus, if |a| > I7I + (k n — j)\u n \ — \u n \, we have 

\a\ + (k n - i)\u n \ > |7| + (k n - j)\u n \ - \u n \ + (k n - j + l)\u n \ 
> \j\ + k n \u n \, 

which is a contradiction because \au kn ~ l f3\ = k n \u n \. 
We have just proved that 

|7| < |a| < |7| + (k n - ])\u n \ - \u n \ = Iju^I - \u n \. 

This implies that, if p is the prefix of length (\a\ — \ mod \u n \ of u n , we have u n p = pu n . 
Since \a\ — I7I 7^ mod \u n \ (otherwise Sj would be a prefix of u n ), Proposition 3.20 implies 
that u n is not primitive, hence a contradiction. 

Now we can conclude the proof: since all words Xi[£ : £ + k n \u n \], i G {1, . . . , k n — \^f \ }, 
£ G {1, 2, . . . , ^|w n |}, are distinct and different from xq, we have 

kn ~\ir~\ 
P w (k n \u n \) > 1+ ^2 

i=i 
£=±k -1 

> \u n \ 2J i 

and we deduce that the complexity is not sub-linear. □ 
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4 Beyond linearity 



Until now, we have provided several examples showing that various natural approaches 
to characterize S-adic sequences that have a sub-linear complexity does not appear to be 
promising. To conclude this paper, we raise a new problem related to S-adicity and, more 
precisely, to everywhere growing S'-adic sequences. 

For purely morphic sequences, the complexity function can have only 5 asymptotic be- 
haviours and only depends on the growth rate of images (see Theorem 2.2). For S'-adic 
sequences we have seen in Section 2.2 that things are highly more complicated. However, it 
has been proved in [Fcr96] (see also [Lerl2b, LR]) that any uniformly recurrent sequence with 
sub-linear complexity is everywhere growing S-adic with Card(S) < +oo. This is a kind of 
generalization of the third point of Theorem 2.2. Moreover, one can check that all examples 
considered in previous sections (and more generally all S-adic representations of well-known 
families of sequences such as codings of rotations, codings of interval exchanges, etc.) are 
everywhere growing. It is also interesting to note that for purely morphic sequences, the 
class of highest complexity 0(n 2 ) can be reached only by morphisms with bounded letters 
(still Theorem 2.2). Furthermore, up to now, Cassaigne's constructions (Proposition 2.9) are 
the only ones that allow to build S-adic sequences with arbitrarily high complexity and they 
admit several bounded letters. Consequently, the fact that the length of all images tends to 
infinity with n seems to be important to get a reasonably low complexity. 

Observe that, given an S-adic sequence w, the everywhere growing property is not a 
necessary condition for w to have a low complexity. Indeed, Cassaigne's constructions also 
hold for sequences with low complexity. One can also think to the Chacon substitution g 
defined by g(0) = 0010 and p(l) = 1 whose fixed point f? w (0) has complexity p(n) = 2n+l for 
all n (see [Fer95]). However, the existence of an everywhere growing S'-adic representation 
of w could be necessary (this is the case for the Chacon substitution and for any uniformly 
recurrent sequence with sub-linear complexity). 

That property is neither a sufficient condition since the sequence w 7i £ of Example 3.5 
satisfies it and does not always have a sub-linear complexity. However the following question 
seems to be natural. 

Question 4.1. Is it possible to reach any high complexity with everywhere growing S-adic 
sequences? 

This question seems to be a new non-trivial problem. Proposition 4.2 below provides a 
partial answer to that question. Indeed, it deals with expansive S-adic sequences, i.e., with 
S-adic sequences such that for all morphisms o in S and all letters a, we have \o~{a)\ > 2. 
The proof can be found in [Lcrl2b] and involves techniques similar to those used in [ELR75] 
for DOL systems. 

Proposition 4.2. Ifw is an expansive S-adic sequence with Card(S) < +oo ; then p^in) G 
0(n logn). 

Example 4.3 shows that this bound is the best one we can obtain. 
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Example 4.3. Let (3 be the morphism 




H> 0120 



2 H> 222 



1 i ^ 11 



and consider its fixed point w = 1^(0). It can be seen as an expansive {$}-adic sequence 
and we know from Theorem 2.2 that p w (n) = 0(nlogn). 
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